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Abstract 

We demonstrate that the gain/loss asymmetry observed for stock indices van- 
ishes if the temporal dependence structure is destroyed by scrambling the time 
series. We also show that an artificial index constructed by a simple average of 
a number of individual stocks display gain/loss asymmetry — this allows us to 
explicitly analyze the dependence between the index constituents. We consider 
mutual information and correlation based measures and show that the stock re- 
turns indeed have a higher degree of dependence in times of market downturns 
than upturns. 
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Introduction 

Inspired by research in the field of turbulence, Simonscn, Jensen, and Jo- 
hansen [l| considered "inverse statistics" of financial time series: what is the 
smallest time interval needed for an asset to cross a fixed return level p? Fig- 
ure [T] shows the distribution of this random variable, the first passage time, for 
the Dow Jones Industrial Average index, for p = ±5%. As noted by Jensen, 
Johansen, and Simonsen 0], the most likely first passage time is shorter for 
p = —5% than for p — 5%, which they refer to as the gain/loss asymmetry. 

In this paper, we show that the gain/loss asymmetry in the Dow Jones index 
vanishes if the time series is "scrambled" — that is, if one considers a new time 
series constructed by randomly permuting the returns. This basic fact, which 
seems to have gone unnoticed in the literature so far, has important implica- 
tions: the gain/loss asymmetry is not due to properties of the unconditional 
index returns, like skewness, but is rather an expression of potentially com- 
plex temporal structure. This finding resonates with the results from Siven, 
Lins, and Lundbek Hansen 0], where wavelet analysis is used to demonstrate 
that the gain/loss asymmetry is a long time scale phenomenon — it vanishes if 
enough low frequency content is removed from the index, that is, if the index 
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Figure 1: Estimated distribution of the first passage time r p for the log price of the Dow Jones 
Industrial Average index (left) and its scrambled version (right). The graphs correspond to 
p = +5% (stars) and p = —5% (rings). The solid lines are fitted generalized gamma density 
functions. 

is sufficiently "detrended." Siven, Lins, and Lundbek Hansen 0] also present 
a generalization of the asymmetric synchronous market model from Donangelo, 
Jensen, Simonsen, and Sncppen [J] where prolonged periods of high correlation 
between the individual stocks during index downturns gives rise to a gain/loss 
asymmetry. 

Whether the constituents of e.g. the Dow Jones index indeed tend to move 
with a greater degree of dependence during market downturns could in principle 
be tested empirically by analysis of the time series of the individual stocks. That 
is an awkward task, however, since the relative weights for different stocks in 
these indices have changed over time in complicated ways. To address this issue, 
we demonstrate that if one defines a new, artificial index by simply taking the 
average of a number of stocks, this index also displays gain/loss asymmetry. 
With the constituents readily available, we consider two measures based on 
correlation and mutual information, and show that there indeed is a higher 
degree of dependence between the stock returns during index downturns than 
upturns. 

Gain/loss asymmetry and temporal structure 

For a given process {It}t>o, for instance daily closing prices of a stock index, 
the first passage time r p of the level p is defined as 

f inf{.s > 0; log(Wl t ) > p} if p > 0, 
Tp \ inf{s > 0; log(Wit) < p} if p < 0, 



2 



and is assumed to be independent of t. The distribution of t p is estimated in 
a straightforward manner from a time series Iq, . . . , It- Consider p > 0, and 
let t + s be the smallest time point such that log(/ t + s /7j) > p, if such a time 
point exists. In that case, s is viewed as an observation of t p . (If p < 0, take 
instead t + s such that log(/ t+s // t ) < p.) Running t from to T — 1 gives a set 
of observations from which the distribution of r p is estimated as the empirical 
distribution. Given the empirical distribution, we follow Jensen, Johansen, and 
Simonsen [2| and compute a fit of the density function for the generalized gamma 
distribution. This density is plotted as a solid line together with the empirical 
distribution in all figures, to guide the eye — we do not discuss the fitted 
parameters, nor claim that r p truly follows a generalized gamma distribution. 

Gain/loss asymmetry for the Dow Jones index 

Figure Q] shows the estimated first passage time for the Dow Jones index. 
As discussed in the Introduction, there is a gain/loss asymmetry in that the 
most likely first passage time is shorter for p — —5% than for p = 5%. Next, 
we construct a scrambled version of the index by randomly re-arranging the log 
returns. Formally, if Iq, . . . , It denotes the time series of daily closing prices of 
the Dow Jones index, let Sit — log(It/It-i) for t = 1, . . . , T and draw a random 
permutation ji, - ■ ■ ,Jt of {1, . . . , T}. We define 



Figure [T] shows that the scrambled index does not display a gain/loss asymme- 
try. This result is surprisingly strong: since the empirical return distributions 
are identical for an index and any of its scrambled versions, it shows that the 
gain/loss asymmetry is an expression of potentially complex temporal structure 
in the index. This fits nicely with the results from Siven, Lins, and Lund- 
bek Hansen where a multiscale decomposition is used to demonstrate that 
the gain/loss asymmetry is a long rather than short scale phenomenon. 

The gain/loss asymmetry in the asymmetric synchronous market model from 
Donangelo, Jensen, Simonsen, and Sneppen [4[ does not disappear when the in- 
dex returns are scrambled. This is to be expected, since the daily returns in that 
model are independent and identically distributed, so all statistical properties 
remain the same when the time series is scrambled. However, for the generalized 
model proposed in Siven, Lins, and Lundbek Hansen [3j the asymmetry does 
vanish, in perfect agreement with the Dow Jones index, see Figure [2j 

Gain/loss asymmetry for an artificial index 

Consider N stocks, and let SVi.t denote the closing price of the nth stock 
on day t, for t — 0,1, ... ,T. We consider the artificial index constructed by 



Sit = SL 



for t = l,...,T, 



and let the scrambled index be given by Iq = Iq and 
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Figure 2: Estimated distribution of the first passage time r p for the log price in a realization of 
the generalized asynchronous market model from Siven, Lins, and Lundbek Hansen Q| (left) 
and its scrambled version (right). The graphs correspond to p = +5% (stars) and p = —5% 
(rings). The solid lines are fitted generalized gamma density functions. 



averaging all the stocks, 

n— 1 ' 

The denominators S nt o give all stocks equal weight in the index at time t = Q. 

We consider historical stock prices from January 1970 until December 2008 
for the following 12 Dow Jones constituents: Boeing Co. (BA), Citigroup Inc. (C), 
El DuPont de Nemours & Co. (DD), General Electric Co. (GE), General Motors 
Corporation (GM), International business Machines Corp. (IBM), Johnson & 
Johnson (JNJ), JPMorgan Chase & Co. (JPM), The Coca-Cola Company (KO), 
McDonald's Corp. (MCD), Procter & Gamble Co. (PG), and Alcoa Inc. (AA). 
These companies are chosen since long time series of stock returns are available, 
but our results are stable in the sense that adding or removing companies give 
very similar results. 

Figure[3]shows that the index constructed from these stocks display a gain/loss 
asymmetry, much like the Dow Jones index, and that the asymmetry vanishes 
if we scramble the time series. 

In what follows, we will use this artificial index as a kind of proxy for a real 
stock index. This has the advantage that the individual index constituents are 
readily available for analysis. This is unlike the Dow Jones index for which the 
relative weights and indeed the set of constituents have changed over time. 
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Figure 3: Estimated distribution of the first passage time t p for the log price of the artificial 
index I (left) and its scrambled version (right). The graphs correspond to p = +5% (stars) 
and p = —5% (rings). The solid lines are fitted generalized gamma density functions. 

Dependence between constituents during periods of index upturns 
and downturns 

Here, and in what follows, {h}t=o,...,T denotes the artificial index defined in 
the previous section. 

Inspired by the generalized asymmetric synchronous market model from 
Siven, Lins, and Lundbek Hansen [3(, our general intuition is that the individual 
stocks tend to "move together" to a greater degree during index downturns than 
during upturns, resulting in more violent downturns than upturns. To quantify 
this, we first divide the price history of our artificial index / into two parts, 
corresponding to upturns and downturns, respectively. 

Fix a window length L and consider the index return over the kth window, 



for k = 1, . . . , [T/L\, where [^J denotes the largest integer smaller than or 
equal to x. We define the set of indices for which the daily returns belong to a 
window over which the index went up, 



AJfc = I kL 



-It 



(fe-l)Li 



u = 



|J {{k-l)L + l,...,kL} 



{k: AI k >0} 



respectively went down, 



D 



|J {{k-l)L + l,...,kL}. 



{fe; A/ fc <0} 



Note that the sets U and D are disjoint. 

We will consider two measures of dependence between all the individual 
stocks and evaluate it for the returns corresponding to days t G U, and compare 
that to the same measures evaluated for days t G D, Before describing the 
first measure, the mean of mutual information, we establish some additional 
notation. Let the nth index be defined by 

1 \ Sm.t 



The nth index is simply the artificial index constructed by averaging all stocks 
except the nth. Denote the log return at day t in the nth stock and index by 
5S n ,t = log(S n ,t/S n ,t-i) and 5I n ,t = log(7 n , t /Z n ,i_i). 

Mean mutual information 

The mutual information of two discrete stochastic variables X and Y is 
defined as 

/ Pxy(x,y) 



\px(x)p Y {y) 



where pxy denote the joint and px and py the marginal probability functions 
of X and Y. Mutual information can be written as M(X, Y) = H(X) + H{Y) - 
H(X,Y), where H(X) and H(Y) are the marginal entropies, and H(X,Y) is 
the joint entropy of X and Y, and it is a measure of dependence in the sense 
that X and Y are independent if and only if M(X, Y) = 0. Mutual information 
can estimated from a finite set {(Xt,Yt)}t=i,...,n of joint samples of (X,Y) in a 
number of different ways, see Paninski [5(. In the computations below we apply 
the most straightforward estimator, the so-called plug-in estimator. 

Let Mu tTl and Mn >n denote the mutual information of the returns of the nth 
stock and index, estimated from the samples {(5S nt t, SI n ,t)}teu respectively 
{(&Sn,t, SIn,t)}t£D- We average over n to obtain the mean mutual information, 
which can be seen as a measure of the degree of dependence between all the 
stocks over periods of upturns, and, respectively, downturns of the index I: 



71=1 
1 N 

n=l 

Figure [4] shows the mean mutual information for varying window length 
- there is clearly a higher degree of dependence between the stocks returns 
during index downturns. However, given the hypothesis that stocks tend to 
"move together" to a greater degree during index downturns, with the result 
that downturns are more dramatic than upturns, there is a potential problem 
with the measure: the mutual information between the nth stock and index is 
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large whenever there is a high degree of dependence, not only when they tend 
to move in the same direction. If some stocks tend to move up when the index 
moves down, this would moderate the downturns, contrary to our intuition, and 
yet result in high values for the mean mutual information. For this reason, we 
also consider a correlation based measure. 

Mean correlation 

Let Cu, n and Co, n denote the correlation between the returns of the nth 
stock and index, estimated from the samples {(5S n> t, SI n ,t)}teU respectively 
{(<5Sn,t, 8In,t)}teD- We average over n to obtain the mean correlation, which 
can be seen as a measure of the degree of dependence between all the stocks 
over periods of upturns respectively downturns of the index I: 



Figure H shows the mean correlation for varying window length — this measure 
of dependence between the stock returns also show markedly higher values dur- 
ing index downturns that during index upturns. Contrary to the mean mutual 
information, however, the presence of "defensive" stocks that move up during 
index downturns would give negative contributions. 

Conclusion 

If the gain/loss asymmetry observed for stock indices were a property of 
the unconditional distribution of returns, then the phenomenon should remain 
invariant under random permutations of the returns — this is not the case, 
as we have demonstrated. We may begin to rely more confidently on expec- 
tations derived from the generalized asymmetric synchronous market model, 
which have previously demonstrated that differences in correlated movements 
in index constituents for down-moves and up-moves can give rise to the kind of 
temporal dependence structure that produces such asymmetry. However, there 
are practical difficulties in exploring the correlations between the time series 
of the individual constituents of real stock indices, since these are not readily 
available, so we have shown that the gain/loss asymmetry can also be repro- 
duced in an artificial stock index constructed as a simple average of a number 
of individual stocks. 

Considering two different measures of dependence, mean mutual information 
and mean correlation, we concluded that there indeed is a greater degree of 
dependence between the constituents of the artificial index during downturns 
than upturns. This part of our analysis can be seen as an attempt to overcome 
some of the general difficulties in formulating tractable ways of analyzing non- 
stationary dependence structure in multivariate stochastic processes. Future 
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Figure 4: The mean mutual information (left) and the mean correlation (right) for the artificial 
index, as function of the window length L. The graphs show the mean mutual information 
and correlation corresponding to index upturns (stars) respectively downturns (rings). 



work in the direction of analyzing the dynamics of the changes in the level of 
dependence between asset prices would certainly be interesting — not least from 
the perspective of investors who seek diversification that does not break down 
at the worst possible time. For instance, is it possible to design a localized 
measure of the level of dependence between stock prices and zoom in even more 
on the points in time where it is changing? 
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